Extending Classical Planning Heuristics to Probabilistic Planning with Dead-Ends
نویسندگان
چکیده
Recent domain-determinization techniques have been very successful in many probabilistic planning problems. We claim that traditional heuristic MDP algorithms have been unsuccessful due mostly to the lack of efficient heuristics in structured domains. Previous attempts like mGPT used classical planning heuristics to an all-outcome determinization of MDPs without discount factor ; yet, discounted optimization is required to solve problems with potential dead-ends. We propose a general extension of classical planning heuristics to goal-oriented discounted MDPs, in order to overcome this flaw. We apply our theoretical analysis to the well-known classical planning heuristics hmax and hadd, and prove that the extended hmax is admissible. We plugged our extended heuristics to popular graph-based (Improved-LAO∗, LRTDP, LDFS) and ADD-based (sLAO∗, sRTDP) MDP algorithms: experimental evaluations highlight competitive results compared with the winners of previous competitions (FF-REPLAN, FPG, RFF), and show that our discounted heuristics solve more problems than non-discounted ones, with better criteria values. As for classical planning, the extended hadd outperforms the extended hmax on most problems.
منابع مشابه
Towards Clause-Learning State Space Search: Learning to Recognize Dead-Ends
The ability to learn from conflicts is a key algorithm ingredient in constraint satisfaction (e. g. [6, 24, 20, 22, 8, 2]). For state space search, like goal reachability in classical planning which we consider here, progress in this direction has been elusive, and almost entirely limited to length-bounded reachability, where reachability testing reduces to a constraint satisfaction problem, ye...
متن کاملGoal Probability Analysis in Probabilistic Planning: Exploring and Enhancing the State of the Art
Unavoidable dead-ends are common in many probabilistic planning problems, e.g. when actions may fail or when operating under resource constraints. An important objective in such settings is MaxProb, determining the maximal probability with which the goal can be reached, and a policy achieving that probability. Yet algorithms for MaxProb probabilistic planning are severely underexplored, to the ...
متن کاملUnsolvability Certificates for Classical Planning
The plans that planning systems generate for solvable planning tasks are routinely verified by independent validation tools. For unsolvable planning tasks, no such validation capabilities currently exist. We describe a family of certificates of unsolvability for classical planning tasks that can be efficiently verified and are sufficiently general for a wide range of planning approaches includi...
متن کاملSixthSense: Fast and Reliable Recognition of Dead Ends in MDPs
The results of the latest International Probabilistic Planning Competition (IPPC-2008) indicate that the presence of dead ends, states with no trajectory to the goal, makes MDPs hard for modern probabilistic planners. Implicit dead ends, states with executable actions but no path to the goal, are particularly challenging; existing MDP solvers spend much time and memory identifying these states....
متن کاملIntegrating Planning, Execution, and Learning to Improve Plan Execution
Algorithms for planning under uncertainty require accurate action models that explicitly capture the uncertainty of the environment. Unfortunately, obtaining these models is usually complex. In environments with uncertainty, actions may produce countless outcomes and hence, specifying them and their probability is a hard task. As a consequence, when implementing agents with planning capabilitie...
متن کامل